Variance Estimation for Finite Populations with Imputed Data
نویسندگان
چکیده
One way of handling survey nonresponse is to impute data for each nonrespondent. When estimating sampling variances, however, treating the imputed data as a complete set frequently leads to underestimates of the true sampling variance. Techniques have been recently developed to yield valid variance estimates in the presence of imputed data for some estimators and sample designs. Economic surveys frequently deal with highly skewed populations and employ high sampling rates, including selection with certainty for large units. It is also common for economic surveys to have administrative or historical data available for use in imputation. This paper describes a Monte Carlo study of variance estimation on skewed populations with high sampling fractions in some strata. We examine a variety of imputation techniques and patterns of nonresponse. We extend the Rao-Shao technique to finite populations and nearest neighbor imputation, and compare the resulting estimators to the true variance.
منابع مشابه
Variance Estimation for Imputed Survey Data with Non-negligible Sampling Fractions
We consider variance estimation for Horvitz-Thompson type estimated totals based on survey data with imputed nonrespondents and with non-negligible sampling fractions. A method based on a variance decomposition is proposed. Our method can be applied to complicated situations where a composite of some deterministic and/or random imputation methods is used, including using imputed data to impute....
متن کاملImputation of Missing Data for the Pre-Elementary Education Longitudinal Study
In the Pre-Elementary Education Longitudinal Study (PEELS), imputation of item missing data was done using AutoImpute (AI) software, which uses semi-parametric modeling to form imputation classes. In this paper, we summarize PEELS experience with AI, investigate the bias aspect of the imputed data for the PEELS teacher questionnaire data, and study the variance estimation of imputed data using ...
متن کاملBalanced Repeated Replication for Stratified Multistage Survey Data under Imputation
Balanced repeated replication (BRR) is a popular method for variance estimation in surveys. The standard BRR method works by rst creating a set of \balanced" pseudo-replicated data sets from the original data set. For a survey estimator ^ , the BRR variance estimator is the average of squared deviations ^ (r) ? ^ , where ^ (r) is the same as ^ but based on the data in the rth pseudo-replicated ...
متن کاملFractional hot deck imputation
To compensate for item nonresponse, hot deck imputation procedures replace missing values with values that occur in the sample. Fractional hot deck imputation replaces each missing observation with a set of imputed values and assigns a weight to each imputed value. Under the model in which observations in an imputation cell are independently and identically distributed, fractional hot deck impu...
متن کاملSelection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets
Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995